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Abstract In this paper we introduce a generalization of the classical L^R^-based 
Sobolev spaces with the help of a vector differential operator P which consists of 
finitely or countably many differential operators P n which themselves are linear 
combinations of distributional derivatives. We find that certain proper full-space 
Green functions G with respect to L = P* T P are positive definite functions. Here 
we ensure that the vector distributional adjoint operator P* of P is well-defined 
in the distributional sense. We then provide sufficient conditions under which our 
generalized Sobolev space will become a reproducing-kernel Hilbert space whose 
reproducing kernel can be computed via the associated Green function G. As an ap- 
plication of this theoretical framework we use G to construct multivariate minimum- 
norm interpolants Sfx to data sampled from a generalized Sobolev function / on X. 
Among other examples we show the reproducing-kernel Hilbert space of the Gaus- 
sian function is equivalent to a generalized Sobolev space. 

Mathematics Subject Classification (2000): Primary 41 A30, 65D05; Secondary 
34B27, 41 A63, 46E22, 46E35 



1 Introduction 

There is a steadily increasing body of literature on radial basis function and other 
kernel-based approximation methods in such application areas as scattered data ap- 
proximation, statistical or machine learning and numerical solutions of partial dif- 
ferential equations. Some recent books and survey papers on kernel methods are 
E El El [9] [H] [12 [161 [17]1 - Given a set of data sites X and associated values Y sam- 
pled from a continuous function /, we use a positive definite function <P to set up 
an interpolant Sfx to approximate /. It is well-known that within the reproducing- 
kernel Hilbert space framework one can also discuss the error analysis and optimal 
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recovery of the interpolation process whenever / belongs to the related reproducing- 
kernel Hilbert space N^R 17 ) (see Section [5] and ifTTll ). However, there are still a 
number of difficulties and challenges associated with this method. Two important 
questions still in need of a satisfactory answer are: What kind of functions belong 
to a given reproducing-kernel Hilbert space? and Which kernel function should we 
utilize for a particular application ? 

In the survey paper [11] the authors give the reader some guidance for dealing 
with this problem. Others - especially statisticians - will attempt to find the "best" 
kernel function by selecting an "optimal" scaling parameter. Here such techniques 
as cross-validation and maximum likelihood estimation are often mentioned (see, 
e.g. lfT5l[T6l ). The recent paper [4| derives a new error bound of the approximation 
by the fundamental functions (Green functions) using scattered shifts. 

In [10 1 the author noted that many kernels can be embedded into the classical 
Sobolev spaces, but that is was not so clear what the difference between the various 
kernels was. As a possible approach to this, he suggested that one might scale the 
classical Sobolev space with the help of a scaling parameter integrated into the ker- 
nel function. Once this is done, we can choose the "best" scaling parameter to set up 
the interpolant that minimizes the norm of the error functional for a given set of scat- 
tered data. Examples 15 . 1 1 and 15 .21 will demonstrate that the classical Sobolev space 
can be reconstructed by different inner-products that allow us to balance the required 
derivatives by selecting various scaling parameters. The related Green functions are 
the Sobolev splines (Matern functions) with appropriate scaling parameters (see 
lfl5l ). Finally, Example 15.31 shows that the reproducing-kernel Hilbert space of the 
Gaussian function is also equivalent to a generalized Sobolev space. This general- 
ized Sobolev space has been applied in the context of support vector machines and 
the study of motion coherence (see lfT3l[T8l ). 

However, in this paper, we view this problem in a slightly different way. We hope 
to construct the reproducing kernel and the reproducing-kernel Hilbert space with 
the help of countably many differential operators {P n }™=i which are themselves lin- 
ear combinations of distributional derivatives (defined in Section HTTT l. Handling the 
vector differential operator P := (Pi,-- - ,f„, • • • ) r , we will generalize the concept 
of classical real-valued L2(K i/ )-based Sobolev space Jf n (m. d ) to be a generalized 
Sobolev space H P (R d ) with a semi-inner product (see Definition l4.41 >. 

In the following we use the notation Re(£) to denote the collection of all real- 
valued functions of the function space £. For example, Re(C(R rf )) expresses the 
collection of all real-valued continuous functions on W d . The real classical Sobolev 
space is usually defined as 




with inner product 



(f,g)jrn m := Y<( Da f> Da g)> f,geJtf n (R d ), 



a\<n 
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where (■, •) is the standard L^R'') -inner product. Our real generalized Sobolev 
space will be of a very similar form, namely 

Hp(r') := {/ e Re(L 2 (M d )) : {P n fT n=x Q U_{^ d ) and £ \\P n f\\l 2m < <»} 

with the semi-inner product 

C/>*W) := £ F«f,Kg), f,8 G H P (R rf ). 

n=l 

We further wish to connect the reproducing kernel to a (full-space) Green func- 
tion G with respect to some differential operator L, i.e., LG = 5q. Since the Dirac 
delta function 5q at the origin is just a tempered distribution in the dual space of 
the Schwartz space, the Green function should be regarded as a tempered distri- 
bution as well. We find that L can be computed by a vector differential operator 
P := (Pi,--- ,P n ) T and its distributional adjoint operator P* := (Pj*, ■ • • ,P*) , i.e., 
L = P* T P = YJj—iPjPj. The definition of the distributional adjoint operator is given 
in Section |4~T1 Under some sufficient conditions, we can prove that the Green func- 
tion G is a positive definite function according to Theorem 14. II In that case the 
reproducing-kernel Hilbert space Ng(R c/ ) related to G is well-defined in Section [3] 
and ifTTl . and its reproducing kernel K on Mr is equal to K(x,y) := G(x — y). More- 
over, Theorems 14.21 shows that N^R'') is equivalent to Hp(R c ') for some simple 
additional conditions. In the proof, we use techniques of distributional Fourier trans- 
forms defined in Section |4~2l and fl4l . as well as the classical Fourier transform. 



2 Background 

Given data sites X = {x\, ■ ■ ■ ,xyv} C M. d (which we also identify with the centers 
of our kernel functions below) and values Y = {yi,- ■ ■ ,yN} C R of a real-valued 
continuous function / on X, we wish to approximate this function / by a linear 
combination of translates of a positive definite (see Section [3Tt function <P. 
To this end we set up the interpolant in the form 

N 

^W-E^-^i). x eR f/ , (l) 

and require it to satisfy the additional interpolation conditions 

s fJ (xj)=yj, ./ 1 V. (2) 

The above system d2j is equivalent to a uniquely solvable linear system 



A^c = Y, 
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where A , x := (<P(xj - x*))^ e R NxN , c:= (c h - ■ ■ ,c N ) T and Y := (y u --- ,y N ) T . 



All of the above is discussed in detail in [17, Chapter 6.1]. 

Example 2.1. One of the best known examples that fits into this framework is the 
univariate Sobolev spline (Matern) interpolant 



and a > is called the Sobolev parameter. 

We can check that <£> is a positive definite function. Moreover, if we define P = 



{P\,P 2 ) T := (d/d\,Gl) T andP* = (P^P^) 7 ■= (-d/dx,oI) T ', then <£ is the Green 
function with respect to L = P* r P = -d 2 /dx 2 + a 2 I. 

Furthermore, the interpolant s/ t x from (Q~|i-(|2]i minimizes the norm |j-|| Sofo of all 
/ e C 1 (R) nRe(L 2 (M d )) with 



subject to the constraints s/,x(x,) = yj, j = 1,. . . ,N (see 12). 

As we will show in Section [.T2l we can in general construct a reproducing-kernel 
Hilbert space N$(R d ) from a positive definite function <t> such that the interpolant 
Sf x is the best approximation of the function / in 3\f$(R rf ) fitting the values Y on 
the data sites X. 



3 Positive Definite Functions and Reproducing- Kernel Hilbert 

Spaces 

Most of the material presented in this section can be found in the excellent mono- 
graph flTl . For the reader's convenience we repeat here what is essential to our 
discussion later on. 



3.1 Positive Definite Functions 

Definition 3.1 (Q7, Definition 6.1]). A continuous even function <P : W d — > R is 
said to be a positive definite function if, for all N EN, all pairwise distinct centers 
Si,... ,xyv € K d , and all c = (c\, ■ ■ ■ ,cn) t € R N \ {0}, the quadratic form 





N N 



Y t Y t CjC k <P{x j -x k )>0. 

7=1 *=1 
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Theorem 3.1 ([17, Theorem 6.11]). Suppose that <P G Re(C(R J )) H Li (R d ) is an 

even function and its h{(R d )-Fourier transform is (j>. Then <P is positive definite if 
and only if <P is bounded and is nonnegative and nonvanishing. 



3.2 Reproducing-Kernel Hilbert Spaces 

Definition 3.2 ([17, Definition 10.1]). Let U(R d ) be a real Hilbert space of func- 
tions / : R d -> R. A kernel K : R d x R d — > R is called a reproducing kernel for 
H(R rf ) if 

(i) £(-,y) G H(R f/ ), for a11 y G M J , 

(i7) /(y) = {K{;y)J) m *) , for ^1 / G H(R d ) and all y G R d . 

In this case H(R rf ) is called a reproducing-kernel Hilbert space. 

Theorem 3.2 ([17, Theorem 10.12]). Suppose that 4> G C(M rf ) nRe ( L i ( K£ 0) « an 
even function and its Li (R d )-Fourier transform is 0. is a positive definite func- 
tion, then 

N&(R d ) = {/ G C(M C/ ) nRe(L 2 (M d )) : <T 1/2 / G L 2 (R d )} , 
is a reproducing-kernel Hilbert space with reproducing kernel 

K{x,y) = <P(x-y), xjef', 
and its inner product satisfies 



w/zere / awafg are the L2(R d )-Fourier transform of f and g, respectively. 



4 Connecting Green Functions and Generalized Sobolev Spaces 
to Positive Definite Functions and Reproducing-Kernel Hilbert 

Spaces 

4.1 Differential Operators and Distributional Adjoint Operators 



First, we define a metric p on the Schwartz space 
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S:={yeC~(K''):Va,/3 G N#,3C aj3r > s.t. sup x$D a y(x) < C aflr } 

so that it becomes a Frechet space, where the metric p is given by 

p( 7l ,y 2 ):= £ 2-N-IHI .f^ 7 '" 72 ^ p a p(r):= sup x^ 7 (x) , 

for each 71,72,7 s §. This means that a sequence { 7 «}™ = i of § converges to an 
element 7G § if and only if x^D a y n (x) converges uniformly to x^D a 7 (x) on W 1 for 
each a,jS e Nq. Together with its metric p the Schwartz space S is regarded as the 
classical test function space. 

Let S' be the associated space of tempered distributions (dual space of S or space 
of continuous linear functionals on S). We introduce the notation 

(T,y) :=T(y), for each T e §' and 7 e S. 

For any f,g G L l ° c (Br) whose product fg is integrable on W 1 we denote a bilinear 
form by 

(/,*):= / /(xk(x)dx. 

If /,g G L 2 (1R J ) then is equal to the standard L 2 (K d ) -inner product. 

For each / G Lj (Mr) n S3 there exists a unique tempered distribution 7V G §' 
such that 

(Tf,7) = (f,Y), for each 7 eS. 

So / G Lj oc (R d ) n S3 can be viewed as an element of S' and we rewrite Tf := /. 
This means that L / 1 0C (R d ) n SJ can be embedded into S', i.e., h[ 0C (M. d ) n S3 C S'. 
The Dirac delta function (Dirac distribution) do concentrated at the origin is also 
an element of §', i.e., (Sq, 7) = 7(0) for each 7 G S (see 0U Chapter 1] and 
Chapter 11]). 

Remark 4.1. SJ denotes the collection of slowly increasing functions which grow at 
most like any particular fixed polynomial, i.e., 



S3. 



{/ G R c/ -> C : /(x) = 0(||x||£) as ||x 2 || ->■ °° for some m G N | , 



where the notation f = 0(g) means that there is a positive number M such that 
\f\<M\g\. 

First, we will define the distributional derivative P : §' — > S' by (strong) derivative 

d ^a k d 
D °"- = Wir^^ where l«h=E«fc, a:=(ai,---,a £ /) r eN^. 

According to IPT41 Chapter 1], the derivative D a is a continuous linear operator from 
S into S. Then P can be well-defined by 
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(PT,y) :=(-iy a \T,D a Y), foreachreS'andyeS. 

Since (D a yi,yi) = (— 1)1"! {Yi,D a Ji) for each 71,72 e S, the restricted operator P\§ 
and the derivative D a coincide on S. For convenience, we denote the distributional 
derivative as P :— D a (similar as (H Chapter 1]). 

Next we wish to define the distributional adjoint operator of the distributional 
derivative. In the same way as before we can introduce the linear operator P* — » 
S' by using the derivative (— l)l a l,D a , i.e., 

(P*T, 7) := (-l) |a| (r,(-l) |a| D a 7) = (T,D a y), for each T e S' and ye S. 

Moreover, the restricted operator P*|§ and the derivative (— l)^D a coincide on S. 
We call the linear operator P* the distributional adjoint operator of the distributional 
derivative P = D a . It can be written as P* = (— 1)N.P = (-l)ND a also. Usually 
people call the derivative = (— \)^D a : S — > § as the classical adjoint operator 
of the distributional derivative P = D a : §' —> §'. Here we can think of the classical 
adjoint operator P* \ § extended to be the distributional adjoint operator P* . 

Now we will define a more general differential operator and its distributional 
adjoint operator by using real linear combinations of distributional derivatives. 

Definition 4.1. The differential operator (with real-valued constant coefficients) 
P :§'—>•§' is defined as 

P'= c a° a ^ where c„£l and a € Nq, m € No. 

\a\<m 

Its distributional adjoint operator P* : §' — S- §' is well-defined by 

P* -= I (-l) lal c a D a . 

\a\<m 

To streamline terminology we will refer to differential operators (with real-valued 
constant coefficients) and distributional adjoint operators simply as differential op- 
erators and adjoint operators respectively in this article. 

It is obvious that the adjoint operator P* of the differential operator P is also a 
differential operator. We can further check that the differential operator P and its 
adjoint operator P* have the following property 

(FT, 7) = (T,P*y) and (P*T,y) = (T,Py), for each T e §' and ye §. 

Any differential operator P is also complex-adjoint invariant, i.e., 

Py = Py, for each 7 e S . 



Remark 4.2. Our distributional adjoint operator is different from the usual adjoint 
operators of bounded linear operators defined in Hilbert or Banach space. Our oper- 
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ator is formed in the dual space of the Schwartz space and it may be not a bounded 
operator if S' is defined as a metric space. But it is continuous when §' is given the 
weak-star topology as the dual of S. However, the idea of this construction is similar 
to the classical ones. Therefore we call it an adjoint as well. 

Finally, we know that the classical Sobolev spaces are defined by weak deriva- 
tives. Here we will explain some relationships between the distributional derivative 
P :=D a and the a th weak derivative. Fixing any / G Lj oe (IR c ') n S3, if there is a 
generalized function g G L l ° c (R d ) n S3 such that 

(8,7) = (-lp(.f,D a Y) = {D a f,y), for each 7 gS, 

then D a f = g is called the a' h weak derivative off. This means that distributional 
derivatives and weak derivatives are the same on the classical Sobolev space. 

Remark 4.3. In the book the definition of the weak derivative has tiny differ- 
ences from the one we use in this article. In particular, they use different test func- 
tions to derive the weak derivative. We now state their definition in order to compare 
it to the above mentioned weak derivative. Fixing any / G L l ° c (M. d ), if there is a gen- 
eralized function g G L l ° c (M. d ) such that 

{g,Y) = (-1) N (/,£>°T), for each 7 G f^W), 

then they call D a f :— g the a th weak derivative off. However, if / G L l l 0C (R d ) n S3 
and D a f£ L l ° c (M. d ) n S3, then the two definitions of weak derivatives are equivalent 
since 9(R d ) is dense in S, where @(R d ) = Cq {R d ) and its dual space ^(W 1 )' are 
defined in [ 1 Chapter 1]. In this case, we can consider the two weak derivatives as 
being the same. 



4.2 Distributional Fourier Transforms 

Denote the Fourier transform and inverse Fourier transform of any 7 G S by 

y(x) := {2n)- d l 2 f /(yK'^dy, y(x) := {2n)- d l 2 f riyW^dy, i := V^l. 

Since y belongs to S for each 7 G S and the Fourier transform map is a homeomor- 
phism of S onto itself, the distributional Fourier transform T G S' of the tempered 
distribution T E S' is well-defined by 

(f,y):={T,f), for each 7 gS. 

Since y=y for each 7 G S, we have 

{T,y) = {t,f), foreachr£S'and7GS. 
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So the Fourier transform of 7 G § is the same as its distributional transform. If 
/ G Li(R d ) or / e L 2 (K J ) then its U(R d ) -Fourier transform or L 2 (K fl ')-Fourier 
transform is equal to its distributional Fourier transform. The distributional Fourier 
transform Sq of the Dirac delta function Sq is equal to {2n)~ d l 2 . Moreover, we can 
also check that the distributional Fourier transform map is an isomorphism of the 
topological vector space §' onto itself (see lfl4l Chapter 1] and [1 Chapter 7]). 

Now we want to define the distributional Fourier transforms of differential oper- 
ators. First, we will derive a linear operator ££ : §' — >• S' using any fixed complex- 
valued polynomial p on R d . Since the linear operator y H> py is a continuous oper- 
ator from S into S, the linear operator Jz? can be well-defined by 

(_Sf T, y):=(T,py), for each T G §' and 7 G S . 

Since jSf/ = pf G h[ 0C (R d ) n SJ when / G h[ 0C (R d ) G S3, we use the notation J2 5 := 
p for convenience. 

Next we consider the case of distributional derivatives. According to [ 14, Chap- 
ter 1], we know that D a y = pf for each 7 G S, where p(x) := (/x) a is a complex- 
valued polynomial onK f/ . Hence we can verify that (PT,y) — (T,pf) for each T G §' 
and 7 G S, where P := D" and p(x) := (/x) a . This implies that = Ff = /3f for 
each T G §'. Therefore we can denote the distributional Fourier transform of a dif- 
ferential operator in the following way. 

Definition 4.2. Let P be a differential operator. If there is a complex-valued poly- 
nomial p on R d such that 

(PT,Y) = (f,py) = (pf, y) for each T G §' and 7 G S, 

then /3 is said to be the distributional Fourier transform of P. 

According to Definition 14.21 each differential operator P possesses a distribu- 
tional Fourier transform p. The complex-valued polynomial p can be written ex- 
plicitly as 

p(x) = £ c K (ix) a , where P = £ c a D a , c„£l, a e w G N . 

|a|<m |a|<m 

Moreover, since P is defined with the real-valued constant coefficients, we have 
p* = p, where p* is the distributional Fourier transform of the adjoint operator P* . 



4.3 Green Functions and Generalized Sobolev Spaces 

Definition 4.3. G is the (full-space) Green function with respect to the differential 
operator L if G G §' satisfies the equation 



LG = do. 



(1) 
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Equation (Q} is to be understood in the distributional sense which means that 
(G,L*y) = (LG,y) = (8o,Y) = 7(0) for each 7 e S. 

According to Theorem l3.1l we can obtain the following theorem. 



Theorem 4.1. Let Lbe a differential operator and its Fourier transform I be positive 
on W 1 so that t l G U(R d ). If the Green function G G C(R d ) nRe(Li(R d )) with 
respect to L is an even function, then G is a positive definite function and 

3(x):= (27i)- d/2 l(x)- 1 , XGR d . 

is the Li (W t )-Fourier transform of G. 

Proof. First we want to prove that S is the Li (W 1 ) -Fourier transform of G. The fact 
that t 1 G Li (R d ) implies that § G Li (W 1 ). If we can check that 

(G,y) = (§,y), for each y eS, 

where G is the distributional Fourier transform of G, then we can conclude that § 
is the L^R^-Fourier transform of G. Since t 1 G C°°(M rf ) and D a (T ') G SJ for 
each a G Nq, f~ 1 7 e S for each 7 G §. Hence 

(G,7> - (ZG^V) = (LG,r l y) = (^,r 1 7 ) 
= ((27r)- rf / 2 ,r 1 7) = ((2 ? r)- J / 2 ,r 1 7) = (S,7)- 

According to IfTTl Corollary 5.24], G can be recovered from its Lj (R^)-Fourier 
transform, i.e., 

G(x) = (2n)- d/2 f S(y)e ,xry dy, x G R d . 

Jm. d 

Then we have 

|G(x)| < (27t)- d f /(y)-y x ^dy < (2%y d llr 1 1|, < 00, 



IL 



which shows that G is bounded. Since § is positive on R , G is a positive definite 
function by Theorem l3.ll 

Definition 4.4. Let the vector differential operator P = (P\, - ■ ■ ,P n , • • • ) T be set up 
by countably many differential operators {P„}™ =1 . The real generalized Sobolev 
space induced by P is defined by 

Hp(r') := {/ G Re(L 2 (R^)) : {P n fT n =x Q U(R d ) and £ \\P n f\\l 2m < °°} 

«=1 

and it is equipped with the semi-inner product 

(/.*)hp(W) : = i ( P nf,P^g), f,g G H P (R rf ). 
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What is the meaning of Hp(R rf )? By the definition of the generalized Sobolev space, 
we know that Hp(R rf ) is a real-valued subspace of L^ffi^) and it is equipped with a 
semi-inner product induced by the vector differential operator P. On the other hand, 
/ € Re(L2(M )) belongs to Hp(R rf ) if and only if there is a sequence {g«}" =1 C 
L 2 (R c/ ) such that \\gn\\l 2{ Md) < - and 

(gn,7) = (gn,Y) - (Pnf,Y) = = (f^Y), for each ye S, n e N. 

In the following theorems of this section we only consider P constructed by a 
finite number of differential operators P\,,,,,P n . If P := (Pi, - ■ ■ ,P n ) , then the dif- 
ferential operator 

n 

L:=P* T P = Y, p j p j 

7=1 

is well-defined, where P* :— (Pj*, ■ ■ ■ ,P*) T is the adjoint operator of P. Furthermore, 
the distributional Fourier transform I of L can be computed as the form 

/» = £pj(x)pj(?) = £lM)PM) = £ \pA*)\ 2 = IIPWIIL 

7=1 7=1 7=1 

where p = (pi,-- - ,p n ) T is the distributional Fourier transform of P and p* = 
(PI ■■■ ,p„) T is the distributional Fourier transform of P*. Now we can obtain the 
main theorem about the space Hp(R'') induced by the vector differential operator 
P:=(P U --- ,P n ) T . 

Theorem 4.2. Let the vector differential operator be P := (P\ , • • • ,P„) T and its dis- 
tributional Fourier transform be p :~ (pi, ■ ■ ■ ,p n ) T ■ Suppose that p is nonzero on 
W 1 andx^ ||p(x) H^ 2 G Li(R d ). If the Green function G € C(M. d ) nRe(Li(R rf )) 
with respect to the differential operator L = P* T P is chosen to be an even func- 
tion, then G is a positive definite function and its related reproducing-kernel Hilbert 
space jVg(R^) is equivalent to the generalized Sobolev space Hp(W ! ), i.e., 

(/>*W') = (/.*)np(W<). f,8 e W) =H P (R d ). 

Proof. By the above discussion, the distributional Fourier transform / of L is equal to 
/(x) = ||p(x)|| 2 . Since p is nonzero on W 1 and x 1— > ||p(x)||9 2 E Li ? is positive 
on R c/ and Z~ 1 € Li (W l ). According to Theorem l4.ll G is a positive definite function 
and its Li (R^)-Fourier transform is given by 

S(x) := (2n)- d / 2 i(xy l = (2n)- d l 2 ||p(x)|| 2 - 2 , x e R d . 

With the material developed thus far we are able to construct the reproducing-kernel 
Hilbert space j\Ig(R c/ ) related to G. We remark that since / is positive on Mr, Lf = 
for some / € L.2(R c/ ) if and only if / = 0. This implies that Pf = if and only if 
/ = 0. Hence we can conclude that Hp(R rf ) is an inner-product space under this 
condition. 
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Next, we fix any / G N G (K rf ). According to Theorem [3T2l / € Re(C(R d )) n 
L 2 (K' / ) possesses an L 2 (K d ) -Fourier transform / and x n- f(x) ||p(x)|| 2 G L 2 (M. d ). 
This means that the functions pjf G L^K''), 7 = 1 , . . . ,n. Therefore we can define 

f Pj :=(PjfreL 2 (R d ), j = l,...,n, 

using the inverse L2(R rf ) -Fourier transform. 

Since /3 ; is a polynomial for each j = 1 , . . . , n, pjf G S for each 7 G S. Moreover, 

pff — pjf = P*jf — PJ 7 implies that 

(fpj,Y) = ((pjflr) = (Pjf, 7) = if, Pjf) = (f,pjr) = (f,pfr) = (f,PjY) = (Pjf,% 

This shows that Pjf = f P . G L 2 (M rf ). Therefore we know that / G Hp(M rf ) and then 
Ji G (M. d ) C H P (R c/ )- 

To establish equality of the inner products we let f,g G K G (]R rf ). Then the 
Plancherel theorem [8] yields 

n n " f - 2 

(/>s)h P (r<o = 11^^ = E(A//>A/£) = L / ^/( x )^( x ) l^( x )| dx 

= [ f(x)J{x)\\p(x)\\ 2 2 dx = [ f(x)§(x)t(x)dx 

JR d JR d 

Finally, we verify that N G (R d ) = H P (R d ). We fix any / G H P (R d ). Let / mdP~f, 
respectively, be the 1^(1^') -Fourier transforms of / and Pjf, j = l,...,n. Using the 
Plancherel theorem [8| again we obtain 

/ \f(x)pj(x)\ 2 dx = (Pjf, pjf) = (fifjpjf) = (Pjf, Pjf) < ~. 

JR d 

And therefore, with the help of the proof above, we have 

/ = {2n) d ' 2 f |/(x)| 2 /(x)dx = {2%) d l 2 f |/(x)| 2 ||p(x)idx 

JR d S (x) Jm. d JR d 

= (2K) d ' 2 j^ f \f(x)p J (x)\ 2 dx<^ 

S JR d 

showing that S" 1 / 2 / G L 2 (R d ). This means in particular that / G Li (]R rf ) because 
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Since the inverse Li -Fourier transform of / is continuous, / G C(R < '). Accord- 
ing to Theoremim / € N G (R d ) which implies that Hp(JT') C l N G (M d ). 



5 Examples of Positive Definite Kernels Generated By Green 
Functions 



5.1 One-Dimensional Cases 



With the theory we developed in the preceding section in mind we again discuss the 
univariate Sobolev splines of Example 12. II 

Example 5.1 ( Univariate Sobolev Splines ). Let a > be a scalar parameter and con- 
siderP:= (d/dx,aI) T with L := P* r P = a 2 I -d 2 /dx 2 . It is known that the Green 
function with respect to L is 

G(x) := — expf-fflxl), xeM. 

2(7 

Since P and G satisfy the conditions of Theorem |4.2| we know G is positive definite 
and that the interpolant formed by G is given by 

N 

!>f^)-=Y,CjG(x-xj), xeR. (1) 

;=i 

Formula (Q~|i denotes a Sobolev spline (or Matern) interpolant. 

Since / G H P (R) if and only if /',/ G L 2 (R rf ), J>? 1 (R) is equivalent to H P (R). 
Applying Theorem |4.2| and |fT71 Theorem 13.2] we confirm that 

1 1 S W 1 1 Hp (M) = min { 1 1 f\ I Hp (K) : / e M ' ( K ) and /( X ./ ) = X/ > J = 1 > ' ' ' ) N ] > 

i.e., is the minimum Hp (R) -norm interpolant to the data from Jf° l (R). 



5.2 d-Dimensional Cases 



Example 5.2 (Sobolev Splines). This is a generalization of Example 15.11 Let P := 
(Qo ' " " ) Qn) T w i tn a sca l ar parameter a > 0, where 



(CS)^ "hen, = 2*, 
(tSS) 172 ^ when; = 2^+1, 



QjH > J Jl(l/2 k < :•„../ 0.1 n.n d 2. 
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Here we use A° := I. With these definitions we get L := P* r P = (a 2 /- A)". 

The Sobolev spline (or Matern function) is known to be the Green function with 
respect to L (see [2 , Chapter 6. 1 .6]), i.e., 

G(X) := n^r(n)a^ {G \^r d,2 K dl2 - n (a ||x|| 2 ) , x e M d , 

where K m {-) is the modified Bessel function of the second kind of order m. Since P 
and G satisfy the conditions of Theorem l4.2l G is positive definite and the associated 
interpolant s/jc is the same as the Sobolev spline (or Matern) interpolant. 

Since / £ H P (R d ) if and only if A n > 2 fJ e L 2 (IR' / ), H v (R d ) is equivalent 
to J^ n {W') which implies that N G (M d ) and Jf"(M. d ) are isomorphic by The- 
orem |4j2] It follows that the real classical Sobolev space Jtff"(M. d ) becomes a 
reproducing-kernel Hilbert space with Hp (W 1 ) -inner-product and its reproducing 
kernel is K(x,y) :=G(x — y). 

In the following example we are not able to establish that the operator P satisfies 
the conditions of Theorem l4.2l and so part of the connection to the theory developed 
in this paper is lost. We therefore use the symbol <P to denote the kernel instead of 
G. 

Example 5.3 (Gaussians). The Gaussian kernel K(x,y) := <P(x — y) based on the 
Gaussian function <P is very important and popular in current research fields such as 
scattered data approximation and machine learning. Many people would therefore 
like to better understand the reproducing-kernel Hilbert space associated with the 
Gaussian function. In this example, we will show that the reproducing-kernel Hilbert 
space of the Gaussian function is equivalent to a generalized Sobolev space. 
We first consider the Gaussian function 

*(*) : = Jpexpt-^N^), xeR rf , <*>0 

We know that <J> is a positive definite function and its Li (W 1 ) -Fourier transform is 
given by (see [5 Chapter 4]) 

0(x) = (2^)-"/ 2 exp^-M|^ xeM *. 

According to Theorem l3.2l the reproducing-kernel Hilbert space of is given by 

N {R d ) = {/ € C{R d ) HRe^O^)) : <T 1/2 / G L 2 (K d )} , 
and its inner-product is equal to 
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where f,g are the L2(M. d ) -Fourier transforms of f,g G 3Nf$(R''), respectively. 
LetP:=(Qj,---,Q^---) r , where 



' A K whenn = 2A, 



Q; . { y2 A EM,, (2) 



(^^k) A*Vwhenn = 2*+l, 



Here we again use 4° := /. Now we will verify that W$(M c/ ) is equivalent to 
Hp(M c/ ). Even though we find that P does not satisfy the conditions of Theorem l4.2l 
we are still able to use other techniques in order to combine the results of this paper 
to complete the proof. 

LetP„ := (Qo,--- ,Q[,) T andL„ :=P^ r P„ for each n G N. We choose the Green 
function G„ with respect to L„, which is the inverse Li(M c/ ) -Fourier transform of 
(27r)- (/ /2f-i when n > d/2. Therefore P„ and G„ satisfy the conditions of Theo- 
rem !4.2l This tells us that - as in Examples 15. II and !5. 21 - Hp n (M ) is equivalent to 
the classical Sobolev space Jt°"(M. d ) for each n e N. Theorem l4.2l further tells us 
that 

Ji Gn (R d ) = H P „ {R d ), when n > d/2. 



Furthermore, we can verify that 



/ G Hp(R d ) /en: =1 H P „(]R rf )andsup||/|| Hp {Rd) 



which implies that ||/|| HRi (Ri) -> ll/ll Hp (Hrf) as n ~> °°- 

Let / G N<p(R d ) and / be the L 2 (M. d ) -Fourier transform off. We can check that 
toll! < • < IIP«toll2 < - < (2^)-'/ 2 0(x)- 1 and ||p„(x)|| 2 -> (2n)- d / 2 $(x)-' 
as n -> oo. Hence, §,7 1/2 / G L 2 (K d ) which implies that / e K Gn (M J ) by Theo- 
rem [3T2] According to the Lebesgue monotone convergence theorem [l \ and Theo- 
rem |4j2] we have 



d S„(x) 



2™L /„, |/( X )| 2 '«( X ) dx = Um / |/( X )| 2 ||Pn( X )ll2 dx 



I A |2 

(2^/2/ 1^1^=11/112 _ 



Therefore / e H P (R rf ) and H/H^^ = ||/|| H p(R<*y 

On the other hand, we fix any / G Hp(M rf ). Then / € H P)i (W 1 ) for each n € N. 
We again use the Lebesgue monotone convergence theorem [ 1 ] and Theorem l4.2l to 
show that 
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/ %^dx= (27T) rf / 2 lim / |/(x)| 2 ||p„(x)||2dx=(27r) rf /2lim / |/(x)| 2 f„(x)dx 



!/(*) 



2 



lim / V } n dx = (2^) d / 2 lim \\f\\L mi , 
(2^)^1^11/11^^^(2^)^11/11^^, 



< °° 



which establishes that §~ l l 2 f G L 2 (K rf ), and therefore / £ W*(R d ). 

Summarizing the above discussion, it follows that the reproducing-kernel Hilbert 
space of the Gaussian kernel is given by the generalized Sobolev space Hp(K f/ ), i.e., 

3\f$(R c ') = Hp(M <r/ ). 

Remark 5.1. If / G ^(R^) = H P (R d ), then / g Jt n (R d ) for each n € N. Accord- 
ing to the Sobolev embedding theorem we have N$(R rf ) C C^(R d ). On the 
other hand, if a function / G Re(C^ (R^)) satisfies ||£> a /|| Loo ( R rf) < Cl a l for some 
positive constant C and each a € Nq, then / G Hp(R rf ) = N*(R <i ). Moreover, if 
we replace the test functions space to be 2$(R d ), then we can further think of the 



Gaussian function <P is a (full-space) Green function of L :— exp(— j^A), i.e., 



L<P = 8q and <£,<5o€^(l 



4a 2 



6 Extensions and Future Works 

In another paper [6 1, we generalize the results of this paper in several ways. 

Instead of being limited to differential operators we allow general distributional 
operators which, e.g., are allowed to be differential operators with non-constant co- 
efficients. In that case the (full-space) Green functions and the generalized Sobolev 
spaces can be constructed by the vector distributional operators in a similar way. 

In addition, we extend all the results from positive definite functions and their 
reproducing-kernel Hilbert space with respect to a vector differential operators to 
conditionally positive definite functions of some orders and their native space with 
respect to vector distributional operators. In this case the real generalized Sobolev 
space will be rewritten as the following from 

H P (R d ) := {/ G Re(L'r(R d ))n§3 : {P„/}~ =1 Q L 2 (R d ) and £ \\P n f\\t 2m < °°} 

«=1 

and it is equipped with the semi-inner product 

(/»*)Hp(tf) : = E i p nf^g), f,8 e H P (R rf ), 

n=\ 
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where P := (P\, ■ ■ ■ ,P„,---) T is a vector distributional operator (see |6ll). 

For example, if P := ((0 T d m /dx%, ■ • • , G> T D a ,- ■ ■ , a> T d m / dx™) T which is set up 
by the differential operators (with the non-constant coefficients), then 

H P (R d ) := {/ e Re(L / 1 " c (R'')) n S3 : co T D a f e L 2 (R d ), \a\ =m,ae Nq| , 

where £» T (x) := | |x 1 1 ^ andO < T < 1. 

In the work presented here and in [6 1 we do not specify any boundary conditions 
for the Green functions. Thus there may be many different choices for the Green 
function with respect to one and the same differential operator L. In our future work 
we will apply a vector differential operator P := (Pi , • • • ,P„) T and a vector boundary 
operator B := (B\,--- ,B S ) T on a bounded domain Q to construct a reproducing 
kernel and its related reproducing-kernel Hilbert space (see [7 1). We further hope to 
use the distributional operator L to approximate the eigenvalues and eigenfunctions 
of the kernel function with the hope of obtaining fast numerical methods to solve 
the interpolating systems (Q]i-© similar as in ifTTl Chapter 15]. 

Finally, we only consider real-valued functions for the definition of our general- 
ized Sobolev spaces and their Green functions in this paper. However, all conclu- 
sions and all theorems can be extended to complex-valued functions similar as was 
done in IfTTl . 
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